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(57) Abstract 



A nucleic acid sequence which codes upon expression in a prokaryotic or eukaryotic host cell for a polypeptide product having one or 
more properties selected from (i) the ability to adhere to a mammalian tooth in a competitive manner with naturally occurring Streptococcus 
mutans antigen I/O, thus pre venting or diminishing the adhesion of S-mutans to the tooth; (ii) the ability to stimulate a T-cell response; and 
(iii) the ability to stimulate a B-ceU response, said nucleic add sequence being selected from: (a) the sequences shown in SEQ. ID. No. 12 
to 22 or the complementary strands thereof, (b) nucleic add sequences having a length of not more than 1000 base pairs which hybridise 
to the seq u e n ces defined in (a) over at least 70 % of their length; (c) nucleic acid sequences having a length of not more man 1000 base 
pairs which, but for the degeneracy of the genetic code, would hybridise to the nucleic add sequences defined in (a) or (b) over at least 70 
% of their length and which sequences code for polypeptides having die same amino acid 
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POLYPEPTIDE FRAGMENTS CAPABLE OF COMPEimON WITH STKEPTOCOCCUS MUTANS 
ANTIGEN IK 



This invention relates to polypeptide fragments of 
the Streptococcus mutans I/II antigen that are useful in 
5 treating and preventing dental caries. 

Streptococcus mutans is the main etiological agent of 
dental caries, a disease which affects mammals including 
humans . 

The £^ mutans l/II antigen (SA I/II) is a cell 

10 surface protein with an My of about 185kDa. It is believed 
to comprise several antigenic epitopes and to be at least 
partly responsible for S-. mutans adhesion to teeth. 

SA I/II is described in British Patent No. 2,060,647, 
as are number antibodies to it. A putative 3.5 to 4.5 JcDa 

15 fragment of SA I/II, "antigen X", has also been described 
in European Patent No. 0 116 472. 

However, it has now become clear that "antigen X" iF 
not a fragment of SA I/II at all. Rather, it is a separate 
protein that merely co-purifies with SA I/II. It is 

20 believed to be encoded by a separate gene. 

Two large fragments of SA I/II, an N- terminal 
fragment (residues 39 to 481) and a 40kDa central fragment 
(residues 816 to 1213) are recognised by human serum 
antibodies. Within the central fragment, 80% of the sera 

25 tested recognise elements within a proline-rich region 
(residues 839-955) that comprises three tandem repeats. 
This suggests that this region includes one or more B-cell 
epitopes. The central fragment (residues 816-1213) is also 
believed to comprise one or more adhesion sites that 

30 mediate £L_ mutans ' attachment to the tooth. 

The aim of the above-mentioned work has been the 
development of vaccines for immunisation against dental 
caries. However, precise identification of the antigenic 
epitopes within SA I/II is a prerequisite for designing 
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synthetic vaccines based on it. Similarly, precise 
identification of adhesion sites is essential for the 
design of drugs against dental caries that rely on 
inhibiting S. mutans ' adhesion to the tooth. 
5 No antigenic epitopes (T-cell or B-cell epitopes) or 

adhesion sites within SA I/II have been characterised, nor 
has the precise location of any such regions been 
suggested. Also, there has been no indication of the 
location of S. mutans ' T-cell epitopes as the above- 

10 mentioned work has concentrated on S . mutans ' ability to 
adhere to teeth and to generate a B-cell response. 

The inventors have identified a number of T-cell 
epitopes, B-cell epitopes and adhesion sites within 
residues 803 to 1114 of SA I/II. Some of the T-cell and B- 

15 cell epitopes overlap or are contiguous with each other 
and/or with one or more of the adhesion sites. 

The presence of a number of antigenic epitopes of 
both types and a number of adhesion sites within the same 
region of SA I/II could not have been predicted and the 

20 finding that some of the adhesion sites and epitopes 

overlap or are contiguous with each other is particularly 
surprising. 

These findings make it possible to design effective 
synthetic vaccines against dental caries as well as drugs 

25 that engender resistance against the disease or alleviate 
pre-existing cases of it by preventing S. mutans ' adhesion 
to the tooth. Further, the surprising finding that some of 
the T- antigenic epitopes and the adhesion site are 
contiguous or overlapping makes it possible to design 

30 bifunctional drugs that effect immunisation against dental 
caries as well as preventing adhesion of g. m^ftP9 to the 
tooth. 

Accordingly, the present invention provides a nucleic 
acid sequence which codes upon expression in a prokaryoic 
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or eukaryotic host cell for a polypeptide product having 
one or more properties selected from (i) the ability to 
adhere to a mammalian tooth in a competitive manner with 
naturally occurring Streptococcus mutans antigen I/II, thus 
5 preventing or diminishing the adhesion of S. mutans to the 
tooth; (ii) the ability to stimulate a T-cell response; and 
(iii) the ability to stimulate a B-cell response, said 
nucleic acid sequence being selected from: 

(a) the sequences shown in SEQ. ID. Nos. 12 to 22 or 
10 the complementary strands thereof; 

(b) nucleic acid sequences having a length of not 
more than 1000 base pairs which hybridise to the 
sequences defined in (a) over at least 70% of their 
length; 

15 (c) nucleic acid sequences having a length of not 

more than 1000 base pairs which, but for the 
degeneracy of the genetic code, would hybridise to 
the nucleic acid sequences defined in (a) or (b) over 
at least 70% of their length and which sequences code 
20 for polypeptides having the same amino acid sequence 

code, would hybridise to the nucleic acid sequences defined 
in (a) or (b) over at least 70% of their length and which 
sequences code for polypeptides having the same amino acid 
sequence . 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Depiction of the panel of overlapping 20mers used 
to map T-cell, B-cell and adhesion epitopes within SA I/II. 

30 

Figure 2. Proliferative responses to overlapping synthetic 
peptides (20*") of SA I/II. 

a; Mean S.I. (±sem) of PBMC from 30 subjects. Mean cpm 
with medium only was 538+112. 



WO 96/23886 



PCI7GB96/D0207 



4 

b) Frequency of positive responses (S.I.>3.0, cpm>500) . 

Figure 3. MHC class II dependency of proliferative 
responses to SA I/II. 

5 

Figure 4, Serum recognition of SA I/II and recombinant 
polypeptide fragments. Western blots from 3 subjects are 
shown (panels a-c) together with rabbit anti-SA I/II 
antiserum (panel d) . Lanes, 1, SA I/II; lane 5, 
10 recombinant 984-1161. 

Figure 5. Human serum recognition of synthetic peptides of 
SA I/II. 

15 a) Titres were determined by ELISA in 22 subjects to 
selected peptides of SA I/II and an irrelevant 
control peptide from SIVp27(SIV). The frequencies of 
sera binding the peptides with a titre >mean+2S.D. 
the control peptide are also indicated. 

20 

Figure 6. Inhibition of adhesion of iL. mutans . 

a) SA I/II and recombinant fragment 984-1161. 

b) Synthetic peptides. 

25 Figure 7. Proliferative responses of murine splenocytes 

following immunization with recombinant 975-1044 (SEQ. ID. 
No. 8) 

Figure 8, Competitive inhibition of SA I/II binding by 
30 various polypeptides. 

Figure 9, Dependence of competitive inhibition of SA I/II 
binding on concentration of two peptides. 
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Figure 10. Effects of substitution of certain residues on 
competitive inhibition. 

Figure 11. Comparison of various recombinant polypeptides 
5 with respect to binding. 



The polypeptides of the invention have one or more of 
the following properties. Firstly, they may have the 
ability to adhere to a mammalian tooth in a competitive 

1C manner with naturally occurring Streptococcus mutans 

antigen I/II, thus preventing or diminishing the adhesion 
of S . mutans to the tooth. Some of the peptides of the 
invention have been shown to inhibit adhesion of S. mutans 
to a tooth surface model (whole human saliva adsorbed to 

15 the wells of polystyrene microtitre plates or 

hydroxyapatite beads) . Thus, these peptides comprise one 
or more adhesion sites and will adhere to a mammalian tooth 
in a competitive manner with naturally occurring SA I/II. 
Therefore, peptides according to the invention that 

20 comprise the adhesion site prevent or diminish the adhesion 
of S. mutans to the tooth. Peptides of the invention that 
comprise one or more adhesion epitopes include SEQ. ID. 
Nos. 1 to 6 and 8 to 10. 

Secondly, peptides according to the invention may 

25 have the ability to stimulate a T-cell response. The 

inventors have shown that residues 803 to 854 and 925 to 
1114 of SA I/II comprise a number of T-cell epitopes that 
are at least partially responsible for the T-cell response 
stimulated by the intact protein. Therefore, peptides 

30 according to the invention that comprise one or more of 
these the T-cell epitopes stimulate a T-cell response 
against S, mutans infection. Peptides of the invention 
that stimulate a T-cell response include those shown in SEQ 
ID Nos. 1 to 11. 
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Thirdly, the peptides of the invention may stimulate 
a B-cell response. The inventors have shown that residues 
303 to 854 and 925 to 1114 of SA I/II comprise a number of 
B-cell epitopes and polypeptides according to the invention 
5 that comprise one or more B-cell epitopes stimulate a B- 
cell response against S. mutans infection. Peptides of the 
invention that comprise one or more B-cell epitopes include 
those shown in SEQ. ID. Nos. 1, 3 to 7 and 10. 

The nucleic acid sequences of the present invention 

1G are preferably DNA, though they may be RNA. It will be 

obvious to those of skill in the art that, in RNA sequences 
according to the invention, the T residues shown in SEQ. 
ID. Nos. 12 to 22 will be replaced by U. Nucleic acid 
sequences of the invention will typically be in isolated or 

15 substantially isolated form. For example up to 80, up to 
90 , up to 95 or up to 100% of the nucleic acid material ir 
a preparation of a nucleic acid of the invention will 
typically be nucleic acid according to the invention. 
Some preferred nucleic acid sequences of the 

20 . invention are those shown in SEQ. ID. Nos. 12 to 22. 
However, the nucleic acid sequences of the present 
invention are not limited to these sequences. Rather, the 
sequences of the invention include sequences that are 
closely related to these sequences and that encode a 

25 polypeptide having at least one of the biological 

properties of naturally occurring SA I/II. These sequences 
may be prepared by altering those of SEQ ID Nos. 12 to 22 
by any conventional method, or isolated from any organism 
or made synthetically. Such alterations, isolations or 

30 syntheses may be performed by any conventional method, for 
example by the methods of Sambrook et §X (Molecular 
Cloning: A Laboratory Manual; 1989) 

For example, the sequences of the invention include 
sequences that are capable of selective hybridisation to 
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those of SEQ. ID. Nos. 12 to 22 or the complementary 
strands thereof and that encode a polypeptide having one or 
more of the properties defined above. Such sequences 
capable of selectively hybridizing to the DNA of SEQ. ID. 
5 Nos. 12 to 22 will generally be at least 70%, preferably at 
least 80 or 90% and more preferably at least 95% homologous 
to the DNA of SEQ. ID. Nos. 12 to 22 over a region of at 
least 10, preferably at least 20/ 30, 40, 50 or more 
contiguous nucleotides. 

10 Such sequences that hybridise to those shown in SEQ. 

ID. Nos. 12 to 22 will typically be of similar size to 
them, though they may be longer or shorter. However, if 
they are longer, they may not simply encode large fragments 
of native SA I/I I amino acid sequence. Thus, sequences 

15 that hybridise to those of SEQ. ID. Nos. 12 to 22 may be 
sequences of up to 1000 bases in length, for example up to 
950 or up to 933 bases in length, 933 bases being the 
length of the DNA sequence encoding the largest 
specifically identified peptide of the invention (SEQ. ID. 

20 No. 21) . Also, sequences that hybridise to those of SEQ. 
ID. Nos. 12 to 22 must do so over at least 50% of their 
length, for example up to 60%, up to 70%, up to 80%, up to 
90%, up to 95%, or up to 99% of their length. 

Such hybridisation may be carried out under any 

25 suitable conditions known in the art (see Sambrook gt jil 
(1989): Molecular Cloning: A Laboratory Manual). For 
example, if high stringency is required, suitable 
conditions include 0.2 x SSC at 60°C. If lower stringency 
is required, suitable conditions include 2 x SSC at 60°C. 

30 Also included within the scope of the invention are 

sequences that differ from those defined above because of 
the degeneracy of the genetic code and encode the same 
polypeptide having one or more of the properties defined 
above, namely the polypeptide of SEQ. ID. Nos. 1 to 11 or 
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a polypeptide related to one of these polypeptides in any 
of the ways defined below. 

Thus, the nucleic acid sequences of the invention 
include sequences which, but for the degeneracy of the 
5 genetic code, would hybridise to those shown in SEQ. ID. 
Nos . 12 to 22 or the complementary strands thereof . 
However, such sequences may not simply encode large 
fragments of native SA I/II amino acid sequences. Thus, 
these sequences may be up to 1000 bases in length, for 

10 example up to 950 or 933 bases in length. Also, their 
sequence must be such that, but of the degeneracy of the 
genetic code, they would hybridise to a sequence as shown 
in SEQ. ID. Nos. 12 to 22 over at least 50% of their 
length, for example, up to 60%, up to 70%, up to 80%, up to 

15 90%, up to 95% or up to 99% of their length. 

Also, the nucleic acid sequences of the invention 
include the complementary strands of the sequences defined 
above, for example the complementary strands of the nucleic 
acid sequences shown in SEQ. ID. Nos. 12 to 22. 

20 Nucleic acid sequences of the invention will 

preferably be at least 30 bases in length, for example up 
to 50, up to 100, up to 200, up to 300, up to 400, up to 
500, up to 600, up to 800 or up to 1000 bases. 

Nucleic acid sequences of the invention may be 

25 extended at either or both of the 5' and 3' ends. Such 

extensions may be of any length. For example, an extension 
may comprise up to 10, up to 20, up to 50, up to 100, up to 
200 or up to 500 or more nucleic acids. A 5' extension may 
have any sequence apart from that which is immediately 5' 

30 to the sequence of the invention (or the native sequence 
from which it is derived) in native SA I/II. A 3' 
extension may have any sequence apart from that which is 3' 
to the sequence of SEQ. ID. No. 13 in native SA I/II. 
Thus, the nucleic acid sequences of the invention may be 
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extended at either or both of the 5' and 3' ends by amy 
non-wild-type sequence. 

The polypeptides of the invention are encoded by the 
DNA sequences described above. Thus, the polypeptides of 
5 the invention are not limited to the polypeptides of SEQ. 
IB. Nos. 1 to 11 although these sequences represent 
preferred polypeptides. Rather, the polypeptides of the 
invention also include polypeptides with sequences closely 
related to those of SEQ. ID. Nos. 1 to 11 that have one or 

10 more of the biological properties of SA I/II. These 

sequences may be prepared by altering those of SEQ ID Nos. 
1 to 11 by any conventional method, or isolated from any 
organism or made synthetically. Such alterations, 
isolations or syntheses may be performed by any 

15 conventional method, for example by the methods of Sambrook 
et al {Molecular Cloning: A Laboratory Manual; 1989). In 
particular, polypeptides related to those of SEQ ID Nos. 1 
to 11 may be prepared by modifying DNA sequences as shown 
in SEQ ID Nos. 12 to 22 expressing them recombinantly . 

20 The polypeptides of the invention may be encoded by 

nucleic acid sequences that have less than 100% sequence 
identity with those of SEQ. ID. Nos. 12 to 22. Thus, 
polypeptides of the invention may include substitutions, 
deletions, or insertions, that distinguish them from SEQ. 

25 ID. Nos. 1 to 11 as long as these do not destroy the 

biological property or properties that the polypeptides 
have in common with SA I/II. 

A substitution, deletion or insertion may suitably 
involve one or more amino acids, typically from one to 

3 0 five, one to ten or one to twenty amino acids, for example, 
a substitution, deletion or insertion of one, two, three, 
four, five, eight, ten, fifteen, or twenty amino acids. 
Typically, a polypeptide of the invention has at least 60%, 
at least 80%, at least 90%, or at least 95% sequence 
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identity to the sequence of any one of SEQ. ID. Nos. 1 to 
11. 

In general, the physicochemical nature of the 
sequence of SEQ. ID. Nos. 1 to 11 should be preserved in a 
5 polypeptide of the invention. Such sequences will 

generally be similar in charge, hydrophobicity and size to 
that of SEQ. ID. Nos. 1 to 11. Examples of substitutions 
than do not greatly affect the physicochemical nature of 
amino acid sequences are those in which an amino acid from 
10 one of the following groups is substituted by a different 
amino acid from the same group: 



H, R and K 

I , L, V and M 
15 A, G, S and T 

D, E, Q and N. 



However, it may be desirable to alter the 
physicochemical nature of the sequence of SEQ. ID. Nos. 1 

20 to 11 in order to increase its therapeutic effectiveness. 
For example, many of the amino acids in the polypeptides of 
the invention are acidic. For example, residues 975 to 
1044 (SEQ. ID. No. 8) as a whole are of an acidic nature. 
This acidity is believed to facilitate binding to a 

25 mammalian tooth. Thus, it may be desirable to increase the 
n: Iditry of polypeptides of the invention by adding acidic 
residues or by substituting acidic residues for non-acidic 
ones. Acidic residues include aspartic acid and glutamic 
acid. 

30 Where polypeptides of the invention are synthesised 

chemically, D-amino acids (which do not occur in nature) 
may be incorporated into the amino acid sequence at sites 
where they do not affect the polypeptides biological 
properties. This reduces the polypeptides' susceptibility 
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to proteolysis by the recipient's proteases. 

The nucleic acid sequences encoding the polypeptides 
of the invention may be extended at one or both ends by any 
non -wild- type sequence. 
5 Thus, the polypeptides of the invention may be 

extended at either or both of the C- and N- termini by an 
amino acid sequence of any length. For example, an 
extension may comprise up to 5, up to 10, up to 20, up to 
50, or up to 100 or 200 or more amino acids. An N-terminal 

10 extension may have any sequence apart from that which is N- 
terminal to the sequence of SEQ. ID. No. the invention (or 
the native sequence from which it is derived) in native SA 
I/II. A C-terminal extension may have any sequence apart 
from that which is C-terminal to the sequence of the 

15 invention (or the native sequence from which it is derived) 
in native SA I/II. Thus, the polypeptides of the inventic»n 
may be extended at either or both of the C- and N- termini 
by any non-wild-type sequence. 

The polypeptides of the invention may be attached to 

20 other polypeptides or proteins that enhance their antigenic 
properties. Thus, polypeptides of the invention may be 
attached to one or more other antigenic polypeptides. 
These additional antigenic polypeptides may be derived from 
S . mutans or from another organism. Possible additional 

25 antigenic polypeptides include heterologous T-cell epitopes 
derived from other S. mutans proteins or from species other 
than S. mutans . Heterologous B-cell epitopes may also be 
used. Such heterologous T-cell and or B-cell epitopes may 
be of any length and epitopes of up to 5, up to 10 or up to 

30 20 amino acids in length are particularly preferred. These 
additional antigenic polypeptides may be attached to the 
polypeptides of the invention chemically. Alternatively, 
one or more additional antigenic sequences may comprise an 
extension to a polypeptide of the invention. 
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A polypeptide of the invention may be subjected to 
one or more chemical modifications, such as glycosylation, 
sulphation, COOH-amidation or acylation. In particular, 
polypeptides that are acetylated at the N- terminus are 
5 preferred, as are polypeptides having C- terminal amide 
groups. Preferred polypeptides may have one or more of 
these modifications. For example, particularly preferred 
peptides may have a C-terminal amide group and N- terminal 
acetylation. 

10 A polypeptide of the invention may form part of a 

larger polypeptide comprising multiple copies of the 
sequence of one or more of SEQ. ID. Nos. 1 to 11 or a 
sequences related to them in any of the ways defined 
herein. 

15 Polypeptides of the invention typically comprise at 

least 15 amino acids, for example 15 to 20, 20 to 50, 50 to 
100 or 100 to 200 or 200 to 300 amino acids. Preferred 
polypeptides include those shown in SEQ. ID. Nos. 1 to 11. 
Polypeptides according to the invention may be 

20 purified or substantially purified. Such a polypeptide in 
substantially purified form will generally form part of a 
preparation in which more than 90%, for example up to 95%, 
up to 98% or up to 99% of the peptide material in the 
preparation is that of a polypeptide or polypeptides 

25 according to the invention. 

The nucleic acid sequences and polypeptides of the 
invention were originally derived from S. mutans . However, 
nucleic acid sequences and/or polypeptides of the invention 
may also be obtained from other organisms, typically 

30 bacteria, especially other streptococci. They may be 

obtained either by conventional cloning techniques or by 
probing genomic or cDNA libraries with nucleic acid 
sequences according to the invention. This can toe done by 
any conventional method, such as the methods of Sambrook et 
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al (Molecular Cloning: A Laboratory Manual; 1989) . 

A nucleic acid sequence according to the invention 
may be included within a vector, suitably a replicable 
vector, for instance a replicable expression vector. 
5 A replicable expression vector comprises an origin of 

replication so that the vector can be replicated in a host 
cell such as a bacterial host cell. A suitable vector will 
also typically comprise the following elements, usually in 
a 5' to 3' arrangement: a promoter for directing 
10 expression of the nucleic acid sequence and optionally a 
regulator of the promoter, a translational start codon and 
a nucleic acid sequence according to the invention encoding 
a polypeptide having one or more of the biological 
properties of SA I/II. A non-replicable vector lacks a 
15 suitable origin at replication whilst a non- express ion 
vector lacks an effective promoter. 

The vector may also contain one or more selectable 
marker genes, for example an ampicillin resistance gene for 
the identification of bacterial transf ormants . One 
20 particular preferred marker gene is the kanamycin 

resistance gene. Optionally, the vector may also comprise 
an enhancer for the promoter. If it is desired to express 
the nucleic acid sequence of the invention in a eucaryotic 
cell, the vector may also comprise a polyadenylation signal 
operably linked 3' to the nucleic acid encoding the 
functional protein. The vector may also comprise a 
transcriptional terminator 3' to the sequence encoding the 
polypeptide of the invention. 

The vector may also comprise one or more non- coding 
sequences 3' to the sequence encoding the polypeptide of 
the invention. These may be from S. mutans (the organism 
from which the sequences of the invention are derived) or 
the host organism which is to be transformed with the 
vector or from another organism. 
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In an expression vector, the nucleic acid sequence of 
the invention is operably linked to a promoter capable of 
expressing the sequence. "Operably linked" refers to a 
juxtaposition wherein the promoter and the nucleic acid 
5 sequence encoding the polypeptide of the invention are in a 
relationship permitting the coding sequence to be expressed 
under the control of the promoter. Thus, there may be 
elements such as 5' non-coding sequence between the 
promoter and coding sequence. These elements may be native 
1C either to S . mutans or to the organism from which the 

promoter sequence is derived or to neither organism . Such 
sequences can be included in the vector if they enhance or 
do not impair the correct control of the coding sequence by 
the promoter. 

15 The vector may be of any type. The vector may be in 

linear or circular form. For example, the vector may be a 
plasmid vector. Those of skill in the art will be able to 
prepare suitable vectors comprising nucleic acid sequences 
encoding polypeptides of the invention starting with widely 

20 available vectors which will be modified by genetic 

engineering techniques such as those described by Sambrook 
et al (Molecular Cloning: A Laboratory Manual; 1989). 
Preferred starting vectors include plasmids that confer 
kanamycin resistance and direct expression of the 

25 polypeptide of the invention via a tac promoter. 

In an expression vector, any promoter capable of 
directing expression of a sequence of the invention in a 
host cell may be operably linked to the nucleic acid 
sequence of the invention. Suitable promoters include the 

30 tac promoter. 

Such vectors may be used to transfect or transform a 
host cell. Depending on the type of vector, they may be 
used as cloning vectors to amplify DMA sequences according 
to the invention or to express this DNA in a host cell. 
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A further embodiment of the invention provides host 
cells harbouring vectors of the invention, i.e. cells 
transformed or transfected with vectors for the replication 
and/or expression of nucleic acid sequences according to 
5 the invention, including the sequences shown in SEQ. ID. 
Nos. 12 to 22. The cells will be chosen to be compatible 
with the vector and may for example be bacterial cells. 
Transformed or transfected bacterial cells, for example 
E .coli cells, will be particularly useful for amplifying 

10 nucleic acid sequences of the invention as well as for 
expressing them as polypeptides. 

The cells may be transformed or transfected by any 
suitable method, such as the methods described by Sambrook 
et (Molecular cloning: A Laboratory Manual; 1989) . For 

15 example, vectors comprising nucleic acid sequences 

according to the invention may be packaged into infectious 
viral particles, such as retroviral particles. The 
constructs may also be introduced, for example, by 
electroporation, calcium phosphate precipitation, biolistic 

20 methods or by contacting naked nucleic acid vectors with 
the cells in solution. 

In the said nucleic acid vectors with which the host 
cells are transformed or transfected, the nucleic may be 
DNA or RNA, preferably DNA. 

25 The vectors with which the host cells are transformed 

or transfected may be of any suitable type. The vectors 
may be able to effect integration of nucleic acid sequences 
of the invention into the host cell genome or they may 
remain free in the cytoplasm. For example, the vector used 

30 for transformation may be an expression vector as defined 
herein. 

The present invention also provides a process of 
producing polypeptides according to the invention. Such a 
process will typically comprise transforming or 
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transfecting host cells with vectors comprising nucleic 
acid sequences according to the invention and expressing 
the nucleic acid sequence in these cells. In this case, 
the nucleic acid sequence will be operably linked to a 
!5 promoter capable of directing its expression in the host 
cell. Desirably, such a promoter will be a "strong" 
promoter capable of achieving high levels of expression in 
the host cell. It may be desirable to overexpress the 
polypeptide according to the invention in the host cell. 

10 Suitable host cells for this purpose include yeast cells 
and bacterial cells, for example E. coli cells, a 
particularly preferred E. coli strain being E. coli K12 
strain BL 21. However, other expression systems can also 
be used, for example baculovirus systems in which the 

\ 3 vector is a baculovirus having in its genome nucleic acid 
encoding a polypeptide of the invention and expression 
occurs when the baculovirus is allowed to infect insect 
cells . 

The thus produced polypeptide of the invention may be 
20 recovered by any suitable method known in the art. 

Optionally, the thus recovered polypeptide may be purified 
by any suitable method, for example a method according to 
Sambrook et al (Molecular Cloning: A Laboratory Manual) . 
The polypeptides of the invention may also be 
25 synthesised chemically using standard techniques of peptide 
synthesis. For shorter polypeptides, chemical synthesis 
may be preferable to recombinant expression. In 
particular, peptides of up to 20 or up to 40 amino acid 
residues in length may desirably be synthesised chemically. 
30 The nucleic acid sequences of the invention may be 

used to prepare probes and primers. These will be useful, 
for example, in the isolation of genes having sequences 
similar to that of SEQ. ID. No. 24. Such probes and 
primers may be of any suitable length, desirably from 10 to 
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100, for example from 10 to 20, 20 to 50 or 50 to 100 bases 
in length. 

The present invention also provides antibodies to the 
polypeptides of the invention. These antibodies may be 
5 monoclonal or polyclonal. For the purposes of this 

invention, the term "antibody", includes fragments of whole 
antibodies which retain their binding activity for a target 
antigen. Such fragments include Fv, F(ab') and F(ab') 2 
fragments, as well as single chain antibodies. 

10 The antibodies may be produced by any method known in 

the art, such as the methods of Sambrook £l (Molecular 
Cloning: A Laboratory Manual; 1989) . For example, they may 
be prepared by conventional hybridoma techniques or, in the 
case of modified antibodies or fragments, by recombinant 

15 DNA technology, for example by the expression in a suitable 
host vector of a DNA construct encoding the modified 
antibody or fragment operably linked to a promoter. 
Suitable host cells include bacterial (for example E. 
coli ) , yeast, insect and mammalian cells. Polyclonal 

20 antibodies may also be prepared by conventional means which 
comprise inoculating a host animal, for example a rat or a 
rabbit, with a peptide of the invention and recovering 
immune serum. 

The present invention also provides pharmaceutical 
25 compositions comprising polypeptides of the invention. 

Three types of pharmaceutical compositions are particularly 
preferred. Firstly, compositions comprising polypeptides 
of the invention that include T-cell and/or B-cell epitopes 
may be used as vaccines against dental caries. Secondly, 
30 compositions comprising polypeptides of the invention that 
comprise adhesion sites will prevent or diminish adhesion 
of S. mutans to the tooth and can be used in the treatment 
of pre-existing cases of dental caries. Thirdly, 
compositions comprising polypeptides of the invention that 
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include both one or more antigenic (T-cell or B-cell) 
epitopes and one or more adhesion epitopes can be used to 
effect vaccination against dental caries at the same time 
as caring pre-existing cases of the disease. A similar 
5 effect can be achieved by including in a composition one or 
more peptides comprising one or more antigenic epitopes and 
one or more peptides comprising one or more adhesion sites. 

A range of mammalian species can be vaccinated 
against dental caries using the polypeptides of the 
10 invention. Vaccination of humans is particularly 
desirable. 

The compositions of the invention may be administered 
to mammals including humans by amy route appropriate. 
Suitable routes include topical application in the mouth, 

15 oral delivery by means of tablets or capsule and parenteral 
delivery, including subcutaneous, intramuscular, 
intravenous and intradermal delivery. Preferred routes of 
administration are topical application in the mouth and 
injection, typically subcutaneous or intramuscular 

20 injection, with a view to effecting systemic immunisation. 

As previously indicated, polypeptides according to 
the invention may also be mixed with other antigens of 
different immunogenicity. 

The compositions of the invention may be administered 

25 to the subject alone or in a liposome or associated with 
other delivery molecules. The effective dosage depends on 
many factors, such as whether a delivery molecule is used, 
the route of delivery and the size of the mammal being 
vaccinated. Typical doses are from 0.1 to 100 mg of the 

30 polypeptide of the invention per dose, for example 0.1 to 
lmg, and 1 to 5mg, 5 to lOmg and 10 to lOOmg per dose. 
Doses of from 1 to 5 mg are preferred. 

Dosage schedules will vary according to, for example, 
the route of administration, the species of the recipient 
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and the condition of the recipient. However, single doses 
and multiple doses spread over periods of days, weeks or 
months are envisaged. A regime for administering a vaccine 
composition of the invention to young human patients will 
5 conveniently be :6 months, 2 years, 5 years and 10 years, 
with the initial dose being accompanied by adjuvant and the 
subsequent doses being about M to X the level of 
polypeptide in the initial dose. The frequency of 
administration can, however, be determined by monitoring 

1C the antibody levels in the patient. 

Where the peptides of the invention are to be applied 
topically in the mouth, one preferred dosage regime is to 
apply one or more polypeptides of the invention on two or 
more occasions, for example 2 to 10 occasions over a period 

15 of a few weeks, for example one to six weeks. A 

particularly preferred regime of this type involves six 
applications of a polypeptide of the invention over a 
period of three weeks. 

Typical doses for each topical application are in the 

20 range of 0.1 to lOOmg for example 0.1 to lmg, 1 to lOmg and 
10 to lOOmg. Doses of from 1 to 5mg for each application 
are preferred. 

While it is possible for polypeptides of the 
invention to be administered alone it is preferable to 

25 present them as pharmaceutical formulations. The 

formulations of the present invention comprise at least one 
active ingredient, a polypeptide of the invention, together 
with one or more acceptable carriers thereof and optionally 
other therapeutic ingredients. The carrier or carriers 

3 0 must be "acceptable" in the sense of being compatible with 
the other ingredients of the formulation and not 
deleterious to the recipients thereof, for example, 
liposomes. 

Formulations suitable for parenteral administration 
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include aqueous and non-aqueous sterile injection solutions 
which may contain ant i- oxidants, buffers, bacteriostatis, 
bactericidal antibiotics and solutes which render the 
formulation isotonic with the blood of the intended 
5 recipient; and aqueous and non-aqueous sterile suspensions 
which may include suspending agents and thickening agents, 
and liposomes or other microparticulate systems which are 
designed to target the compound to blood components or one 
or more organs. 

10 In particular, the polypeptides of the invention may 

be coupled to lipids or carbohydrates. This increases 
their ability to adhere to teeth, either by prolonging the 
duration of the adhesion or increasing its affinity, or 
both This is particularly desirable for shorter 

15 polypeptides of the invention, which comprise up to around 
40 amino acid residues. 

Of the possible formulations, sterile pyrogen- free 
aqueous and non-aqueous solutions are preferred. Also 
preferred are formulations in which the polypeptides of the 

20 invention are contained in liposomes. Injection solutions 
and suspensions may be prepared extemporaneously from 
sterile powders, granules and tablets of the kind 
previously described. 

Oral methods of administration may produce an effect 

25 systemically or locally in the mouth. Orally active 
preparations can be formulated in any suitable carrier, 
such as a gel, toothpaste, mouthwash or chewing gum. 

It should be understood that in addition to the 
ingredients particularly mentioned above the formulations 

i'i of this invention may include other agents conventional in 
the art having regard to the type of formulation in 
question. 

Accordingly, the present invention provides a method 
of vaccinating a mammalian host against dental caries or 
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treating dental caries, which method comprises 
administering to the host an effective amount of a 
pharmaceutical composition as described above, for example 
a vaccine composition, 
5 Antibodies, including monoclonal antibodies, can be 

formulated for passive immunisation as indicated above for 
the formulation of including polypeptides of the invention. 
Preferred formulations for passive immunisation include 
solid or liquid formulations such as gels, toothpastes, 

10 mouth- washes or chewing gum. 

A further aspect of the present invention is a naked 
nucleic acid vaccine. In this embodiment, the vaccine 
composition comprises a nucleic acid, typically an isolated 
nucleic acid, preferably DNA, rather than a polypeptide. 

15 The nucleic acid is injected in to a mammalian host and 
expressed in vivo, generating a polypeptide of the 
invention. This stimulates a T-cell response, which leads 
to protective immunity against dental caries in the same 
way as direct vaccination with a polypeptide of the 

20 invention. 

Naked nucleic acid vaccination can be carried out 
with any nucleic acid according to the invention as long as 
it encodes a polypeptide that stimulates a T-cell and or B- 
cell response. Preferred nucleic acids are those shown in 

25 SEQ. ID Nos. 1 to 11 . These will typically be included 

within an expression vectors as defined above. In such an 
expression vector, the nucleic acid according to the 
invention will typically be operably linked to a promoter 
capable of directing its expression in a mammalian host 

30 cell For example, promoters from viral genes that are 
expressed in the mammalian cells such as the 
cytomegalovirus (CMV) immediate early gene promoter are 
suitable. Also suitable are promoters from mammalian genes 
that are expressed in many or all mammalian cell types such 
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as the promoters of "housekeeping" genes. One such 
promoter is the p-hydroxymethyl-CoA-reductase (HMG) promoter 
(Gautier et ^1 (1989): Nucleic Acids Research; 17,8839). 

For naked nucleic acid vaccination, it is preferred 
5 that the nucleic acid sequence according to the invention 
is incorporated into a plasmid vector, since it has been 
found that covalent closed circle (CCC) plasmid DNA can be 
taken up directly by muscle cells and expressed without 
being integrated into the cells' genomic DNA (Ascadi et al 

10 (1991) : The New Biologist; 3, 71-81) . Naked nucleic acid 
vaccine may be prepared as any of the types of f ormulatior. 
mentioned above in respect of conventional polypeptide- 
based vaccines. However, formulations suitable for 
parenteral injection, especially intramuscular injection, 

15 are preferred. Naked nucleic acid vaccines may be 

delivered in any of the ways mentioned above in respect of 
conventional polypeptide -based vaccines but intramuscular 
injection is preferred. 

Accordingly, the present invention provides a vaccine 

20 composition comprising a nucleic acid sequence or vector as 
described above and an acceptable carrier. 

The following examples illustrate the invention. 

EXAMPLES 

25 MATERIALS AND METHODS 

Materials. Fmoc amino acids, benzotriazole-l-yl-oxy-tris- 
pyrrolidino-phosphonium hexaf lurophosphate (PyBOP) and Rink 
Amide MBHA resin were purchased from Calbiochem-Novabiochem 
(UK) Ltd., (Nottingham, UK). Dimethylformamide, 

3 0 trif luoroacetic acid, diethyl ether, dichloromethane and 
piperidene were purchased from Romil Chemicals Ltd 
(Loughborough, UK) . Di-isopropylethylamine was from 
Aldrich Chemical Co. (Dorset, UK) . Oligonucleotides were 
purchased from Oswel DNA Service (University of Edinburgh, 
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Edinburgh, UK) . 

Bacteria and growth conditions 

S. mutans Guy's strain (serotype c) were grown in 10L basa.l 
5 medium supplemented as described previously (Russel al 
(1978): Arch. Oral Biol., 2317; Russel et al (1980): 
Infect, Innnun. 61, 5490) at 37°C for 72h for SA I/II 
preparation. For the adhesion assay, S . mutans were grown 
in Todd-Hewitt broth (Difco Laboratories, Detroit, Mich J . 

1C Escherichia coli BL21 (DE3) (Novagen Inc., Madison, Wis.) 
harbouring pET15b were grown at 37°C in Luria-Bertani broth 
supplemented with carbenicillin (50/xg/ml) and recombinant 
protein expression was induced with isoppropyl-6-D- 
thiogalactopryanoside (ImM) . 

15 Antigens. SA I/II was prepared from S. mutans (serotype c, 
Guy's strain) as described by Russel et al (1980: Infect. 
Immun. 28, 486) . Using the procedure of Munro et al (1993: 
Infect. Immun. 61, 4590), the portion of the gene encoding 
residues 984-1161 was amplified by using the 

20 oligonucleotide primers: (5') 

ATACATATGCCAACTGTTCATTTCCATTACTTT (SEQ. ID. No. 25) and 
(3') GCCATTGTCGACTCATTCATTTTTATTAACCTTAGT (SEQ. ID. No. 
26), cloned into pET15b (modified by the addition of a Sal 
I site) and expressed in E. coli . 

25 

Synthetic peptides. Peptide amides (20 mers overlapping by 
10 residues) were synthesised on Rink amide MBHA resin in 
sealed porous polypropylene bags by the manual simultaneous 
multiple peptide synthesis procedure (Houghten (1985) PNAS 
30 892, 5131) using Fmoc chemistry. PyBOP was used as 

coupling agent and Fmoc amino acids were activated in situ 
by addition of diisopropylethylamine . Following 20 cycles 
of synthesis, resin was washed with dimethyl formamide 
followed by dichloromethane and peptides were cleaved by 
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incubation in trif luoroacetic acid- ethanedithiol-anisole- 
phenol-H 2 0 (82.5:2.5:5:5:5; v/v/v/w/v) for 2h at room 
temperature. Peptides were precipitated by the addition of 
5 volumes ether, recovered by centrifugation and washed 
5 three times with ether. Finally, peptides were dissolved 
in water and lyophilised. The scale of synthesis was 
50jimol. Aliquots of each peptide were hydrolysed in 6M HCl 
at 110*C for 24h and compositions were determined using the 
Beckman 121MB automated analyser (Beckman Instruments Ltd, 
10 Bucks, UK) . In each case the composition matched that 
predicted. 

Antibodies. MAbs, L243 (anti-MHC class II) and W6/32 
(anti-MHC class I) were produced from cultures of 

15 hybridomas obtained from the American Type Culture 

Collection (Rockville, Md, USA) . ID4 an isotype (IgG2a) 
matched control of irrelevant specificity was provided by 
Dr. P. Shepherd (Department of Immunology, UMDS, Guys 
Hospital, London, UK) . Rabbit anti-SA I/II antiserum was 

20 prepared as described previously (Russel et al (1980) . 
Infect. Immun. 28, 486). 

Lymphoprolif erative assay. Defibrinated blood from 
volunteers was separated on a Ficoll gradient . Sera was 

25 used for antibody assays (see below) while peripheral blood 
mononuclear cells (PBMCs) were washed and resuspended in 
RPMI 1640 (Sigma Chemical Co., St. Louis, Mo, USA) 
supplemented with 2mM L-glutamine, penicillin (100 IU/ml) , 
streptomycin sulphate (100 fig/ml) and 10% heat -inactivated 

30 autologous serum. PBMCs (10 5 cells/well) were cultured in 
96-well round-bottomed plates (Costar, Cambridge, Mo, USA) 
in a total volume of 200 jil. Three replicates of each 
culture were incubated with three concentrations (1, 10 and 
40 ng/ml) of SA I/II, recombinant fragments, non- 
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recombinant control or synthetic peptides. Incubation was 
at 37"C in a humidified atmosphere with 5% C0 2 for 6 days. 
Each culture received 0.2 //Ci (7.4kBq) of [ 3 H] -thymidine 
(Amersham International, Bucks, UK) 6h before harvesting. 
> Cultures were harvested onto glass fibre filters using a 
Dynatech (Chantilly, VA, USA) Minimal Cell harvester and 
[ 3 H] thymidine incorporation was measured using the LKB 
liquid scintillation counter (Bromma, Sweden) . 
Proliferation was expressed as stimulation index which is 
10 mean counts per minute (cpm) of antigen- stimulated, divided 
by cpm of antigen-free cultures. Concanavalin A (10 jig/ml) 
(Sigma Chemical Co., St. Louis, Mo, USA) was used with 
every culture as a positive control but the results are not 
presented. 

15 MHC dependency of proliferative responses to SA I/II 

was determined by culturing cells with antigen (10 /xg/ml) 
as above in the presence of MAbs L235, W6/32 or ID4 at 1, 
10 and 20 /xg/ml. Cultures were incubated with [ 3 H] - 
thymidine, harvested and [ 3 H] -thymidine uptake was 

20 determined as described above. 

ELISA for serum antibodies. Antibody recognition of 
synthetic peptides was determined by ELISA. Peptides (10 
/zg/ml) in phosphate buffered saline (PBS) were adsorbed to 

25 wells of polystyrene microtitre plates (Dynatech) for 2h at 
room temperature. Plates were washed and wells were 
treated with 1.5% (w/v) bovine serum albumen (BSA) for lh 
at room temperature to block unbound sites. After washing, 
bound peptides were incubated with serially diluted sera in 

30 duplicate. Bound IgG antibodies were determined by 

incubation with alkaline phosphate conjugated -goat anti- 
human Ig (Sigma Chemical Co.) and subsequent reaction with 
paranitrophenyl phosphate (Sigma Chemical Co.). Plates 
*ere read at 405nm using the microplate reader model 450 
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(Bio-Rad) . After initial screening, the assay was repeated 
at least 3 times with each serum using a restricted set of 
peptides. SA I/II (2 jzg/ml) was included in each assay as 
war? an irrelevant peptide (HQAAMQIIRDIINF.EAADWD (SEQ. ID, 
5 No. 27) derived from the sequence of SIV p27. Results are 
e>cpressed as the highest dilution giving an absorbance 
>0.2. 

Western blotting. Serum antibody responses were also 

10 assayed by Western blotting using SA I/II, the recombinant 
polypeptides and a control fraction from £. coli BL21 
harbouring non-recombinant pET15b. Purified antigens were 
separated by sodium dodecyl sulphate-polyacrylamide gel 
electrophoresis (SDS-PAGE) with gels of 10% acrylamide, by 

15 using a mini -gel system (Hoeffer Scientific Instruments, 
San Francisco, Co, USA) . Proteins were transferred to 
nitrocellulose with a semi-dry blotter (Sartorius A.G., 
Gottingen, Germany) . Nitrocellulose strips were blocked 
with 5% (wt/vol) nonfat milk powder 2.5% (wt/vol) BSA in 

20 Tris-HCl-buf fered saline (pH 8.0) containing 0.05% (wt/vol) 
Tween 20. Strips were subsequently incubated with human 
sera (1 in 20 dilution) or rabbit anti-SA I/II antiserum 
(10~ 4 dilution) and bound antibody was visualised by using 
alkaline phosphatase -conjugated secondary antibody with 5- 

25 bromo-4-chloro-3- indolylphosphate and nitroblue 

c£t^azolium (Sigma Chemical Co.) as substrates. Each sera 
was assayed three times and responses were considered as 
positive if bands were visible in at least two assays. 

30 Bacterial adherence assay. SA I/II mediated adherence of 
S. mutans (Guy's strain) to saliva was assayed by 
determining binding of [ 3 H] -thymidine labelled bacteria to 
saliva adsorbed to microtitre wells. Freshly collected 
human saliva from a single donor was clarified by 
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cent rifugat ion for 10 min at 30009, heat -inactivated at 
60°C for 30 min and finally clarified, by cent rifugat ion at 
17,000g for 20 rain. Treated saliva was diluted with an 
equal volume of PBS and adsorbed to the wells of a 
5 polystyrene 96 -well flat -bottomed microtitre plate (Immulon 
4; Dynatech) for 2h at room temperature. After coating, 
wells were washed three times with PBS and unbound sites 
were blocked by incubation with 1.5% (wt/vol) BSA in PBS 
for lh at room temperature. Plates were then washed three 

10 times with 50 mM KCl-lmM CaCl a -38 mM MgCl 2 -l mM KH 2 PO«-1.2 
mM K 2 P0 4 (pH 7.2; adherence buffer) . S. mutans cells from 
an overnight culture in Todd-Hewitt broth were used to 
inoculate (1/10 volume) a further culture in Todd-Hewitt 
broth containing 100/iCi (3.7 MBq) [ 3 H] -thymidine (Amersham 

15 International pic) per ml. Cells were harvested in late 
log phase (O.D. 700nm approximately 0.4) pelleted by 
centrifugation at lOOOg for 10 min and washed three time** 
in adherence buffer. The final suspension was vortexed 
with 0.5 volume glass beads to break up chains of cocci 

20 which was monitored microscopically (Munro et a! (1993) : 

Infect. Immun. 61 , 4590) . Cells were resuspended to 5x10* 
c.p.m. per 50 jil and BSA was added to 1.5% (wt/vol). 
Specific activity of the washed S. mutans cells was 
estimated to be 1.3x10 ~ 3 c.p.m. per cell (Munro e£ 

25 (1985): Infect. Immun. 61, 4590). In competitive 

inhibition of adherence, the various synthetic peptides 
were added to the wells (at final concentrations 62.5- 
500jiM) in 50 jil adherence buffer containing 1.5% (wt/vol) 
BSA together with 50 fil radiolabelled S. mutans suspension. 

30 Microtitre plates were incubated at 37°C for 2h with gentle 
shaking and subsequently were washed ten times with 
adherence buffer. Bound S. mutans cells were eluted with 
1% (wt/vol) SDS and transferred to glass fibre filters by 
using the Micromate 196 cell harvester (Canberra Packard, 
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Berks, UK) . Filters were counted using the Matrix 96 
direct beta counter (Canberra Packard) . Background binding 
was determined on wells to which no saliva was adsorbed. 
The percentage of binding of S. mutans to saliva was 
5 calculated by the formula [(test c.p.m.) - (control c.p.m.) 
/ total c.p.m.] xlOO. Percent inhibition of adherence was 
calculated as [ (percent adherence without inhibitor- 
percent adherence with inhibitor) / percent adherence 
without inhibitor] xlOO, For proteins, determinations of 
10 streptococcal adhesion were made in triplicate or 

quadruplicate at each protein concentration while for 
peptides, duplicate determinations were made. In each caue 
the assay was performed at least three times. 

15 Statistics 

The student's t test was used to analyse results. 

Example 1 

Preparation of a panel of overlapping synthetic peptides 
20 and analysis of their properties. 

T cell epitope mapping 

A panel of 32 overlapping synthetic peptides, 
spanning residues 803-1174 of SA I/II, was prepared, as 

25 described above (See Figure 1) . Proliferative responses of 
PBMCs from 30 subjects were determined by stimulation with 
peptides (see Figure 2) . All subjects responded to at 
least one peptide with a band range of 1-6 peptides, and a 
mean of 4.4 peptides. On the basis of frequency of 

30 response to each peptide (SI>3.0 c.p.m. >500) 3 

immunodominant epitopes were identified; peptides 803-822, 
975-994 and 985-1004, each yielding frequencies >50% (Fig. 
1) . Since most (13/15) subjects who responded to peptide 
975-994 also responded to peptide 985-1004, it is probable 
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that a single T-cell epitope is present within residues 
975-1004. Minor T cell epitopes were also identified within 
peptides 1005-1024, 1015-1034, 1085-1104 and 1115-1134 with 
frequencies >20% and some of the adjacent peptides may 
5 represent single T cell epitopes. 

MHC restriction of the lymphoprolif erative responses (See 
Figure 3 and Table 2) 

HLA restriction of the T cell response was first 
1C studied by dose -dependant inhibition with MAb to HLA class 

I and II antigen (Fig. 2) . The lymphoprolif erative 
response was inhibited by 50% with 1/xg of MAb to HLA class 

II (L243) and 10/ig of the MAb inhibited 100% of the 
responses (from SI 10.0+3.2 to SI 1.5+0.4). Neither MAb to 

15 HLA class I (W6/32) nor the isotype control induced any 
inhibition of the lymphoprolif erative response. 

The HLA-DR of 17 subjects were determined and 6 of 
these were homozygous. The responses of the immunodominant 
and minor epitopes were then studied in the 6 DR homologous 

20 subjects (Table 2) . Only peptide 975-994 appeared to be 
restricted by HLA-DR1. The other 6 peptides stimulated 
lymphocytes from HLA-DR1 , 2 (except AA 1085-1104) and DR6 
(except AA 803-822) . DR5 was restricted by peptide 803- 
922, though the latter stimulated lymphocytes with DR1, 2 

25 and 3 antigens. Lymphocytes with DR3 or 4 antigen 

responded to 3 or 4 peptides. The results suggest that 
except for peptide 975-994, the remaining € peptides appear 
to be promiscuous as they stimulated lymphocytes with 3 to 
5 HLA-DR antigens. 

30 
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B cell epitope mapping (see Figure 4) 

Recognition of the recombinant fragments was assessed 
by Western blotting. Representative blots obtained with 
sera from 3 individuals are shown in Fig. 3 together with a 
5 positive control using rabbit anti-SA I/II antiserum. In 
panel a, SA I/II , and 984-1161 were recognised strongly. 
Rabbit anti-SA I/II antiserum used as a positive control 
(panel d) recognised recombinant 984-1161. The recombinant 
polypeptide corresponding to residues 984-1161 was also 
10 analysed. SA I/II was recognised by all subjects. B cell 
epitopes were mapped by ELISA using the panel of synthetic 
peptides. The panel of peptides was screened with sera 
from 22 individuals and 8 peptides which were recognised by 
more than one individual, together with one peptide which 
15 was not recognised, were selected for further analyses 

(Fig. 5) . SA I/II was recognised by all subjects with mean 
log 2 titre of 7.6+1,2. Titres against peptides were lower, 
with only that against peptide 824-843 (mean log 2 titre 
4.7+1.1) being significantly greater than the titre against 
20 the control SIV p27 peptide (t=7.28 p<0.01). The 
proportion of significant titres (>mean +2 standard 
derivations) was also calculated (Fig. 5) and only peptide 
824-843 showed high frequency (18/22) . Indeed, an 
immunodominant B cell epitope is present within peptide 
824-843, possibly shared with the overlapping peptide 834- 
353, while peptides 925-944, 1035-1054 and 1085-1104 
constitute minor B cell epitopes. Despite the high 
frequency of responses to the recombinant polypeptide 984- 
1161 described above) , a very low frequency of responses 
was observed to peptides within this region. 

Saliva samples from the subjects were cultured to 
determine levels of S. mutanft . In 66% of individuals S. 
mutans was detected (range 10 3 -10 5 colony forming 
units/ml) . There was no correlation between S. mutans 
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levels and recognition of particular epitopes or titre 
against SA I/I I. 

Adhesion epitope mapping 

5 Adherence of S. mutans to saliva-coated microtitre 

wells {a model of the tooth surface) was determined with 
[ 3 H] -thymidine labelled S. mutans . The proportion of 
adhering bacteria was in the range 1-5%. In the absence cf 
saliva, the proportion of adhering bacteria was <0.1%. 

10 In a series of competitive inhibition assays, the 

panel of synthetic peptides was assayed for inhibition of 
adhesion of S. mutans to saliva-coated microtitre wells. 
Peptides 1005-1024, 1025-1044 and 1085-1104 consistently 
inhibited adhesion with maximal inhibition ,> 90% at 

15 concentrations of 500 fM (Fig. €) . Adjacent peptides 1015- 
1034 and 1095 -3114 showed more variable and lower 
inhibition, and may be part of the adhesion epitopes. 

Example 2 

20 Construction of an expression vector and expression of a 
recombinant polypeptide of the invention (SEQ. ID. No. 8) . 

Using the oligonucleotide primers TAT CAT ATG CAA GAT CTT 
CCA ACA CCT CCA TCT ATA (5') (SEQ. ID. NO. 29) and GTC GAC 
TCA TAC CAA GAC AAA GGA AGT TGT (3') (SEQ. ID. No. 30) the 

25 portion of the SA I/II gene encoding residues 975-1044 

(SEQ. ID. No. 8) was amplified by polymerase chain reaction. 
The amplified gene fragment (with introduced Nde I and Sal 
I restriction enzyme sites) was cloned using the Ta cloning 
system and was subcloned into the plasmid pETlSb. The 

30 recombinant polypeptide was expressed in p. cqH BL21 
(DE3) . 

Example 3 

Stimulation of an in vitro T-cell response by the 
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reco mb inant polypeptide (SEQ. ID. No. 8) . 

Peripheral blood lymphocytes from human volunteers were 
prepared as described above. Cells were incubated with 
purified recombinant polypeptide 975-1044 at concentrations 
5 of 40, 10 and 1/ig/ml. Cells were also incuabted with a 
protein fraction prepared in the same way from E, coli 
harbouring non- recombinant plasmid. Proliferative 
responses of 17 subjects were determined. Mean stimulation 
index (+sem) was 11.6+2.3 compared with 2.4+0.3 for the 
10 control. The frequency of subjects responding (i.e. those 
with stimulation index > control + 2SD) was 15/17. 

Example 4 

Immunisation of mice with the recombinant polypeptide (SEQ. 
15 ID. NO. 8) (See Figure 7) 

i) Groups of mice (3-4 per group) were immunised with 
975-1044 (SEQ. ID. No. 8) by two routes: 

a) intraperitoneal ly with 50pg polypeptide in incomplete 
Freund's adjuvant with a boost after 4 weeks (also 50/ig in 

20 incomplete Freund's adjuvant and intraperitoneally) . 

b) subcutaneously. A single immunisation with 50/xg 
polypeptide in incomplete Freund's adjuvant. 

ii) Draining lymph nodes were removed 10 to 14 days after 
immunisation, pooled and homogenised to give a single cell 

25 suspension in RPMI 1640 culture medium supplemented with 
2mM glutaraine, ImM pyruvate, 50mM 2-mercaptoethanol, 
lOOu/ml penicillin, 100f*g/ml streptomycin, 100mM HEPES and 
5% foetal calf serum. Cells (2xl0 5 /well) were cultured 
with antigen and proliferation was measured by 

30 incorporation of [ 3 H] -thymidine as described above. 

Antigens were SA I/II recombinant polypeptides, peptides 
spanning residues 975-1044 and a control protein fraction 
from E. coli harbouring non- recombinant plasmid. 

As in Fig 7, all mouse strains responded to SA I/II 
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and the recombinant polypeptide 975-1044 (SEQ. ID. No. 8) . 
Positive responses to peptides were those of stimulation 
index >3.0 (cpm>500) . SJL mice responded to peptide 985- 
1004 and DBA/a mice responded to peptide 975-995 and 985- 
5 1004. For BALB/c mice, no significant responses to 

peptides were observed although the response to peptide 
985-1004 was greater than responses to the remaining 
peptides. 

iii) Antibody recognition (See Table 2) 

10 Sera from mice immunised intraperitoneally with 

polypeptide 975-1044 recognised intact cells of S.mutans, 
intact SA I/II and recombinant 975-1044. Peptides 995-1014 
and 1025-1044 were also recognised. The titre for each 
strain was as in Table 2, which shows log 2 titres where 

15 initial dilution was 1 in 50 (titre = 1) . 
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Example 5 

Analysis of the interaction between streptococcal antigen 
5 I/XX and salivary receptor using BIAcore 

AIMS 

In this study, we have used surface plasmon resonance (sp?r) 
to analyse the interaction between purified SA I/II and 
10 whole human saliva or purified salivary receptor. In 
addition we have investigated the calcium dependence of 
binding, identified individual amino acid residues which 
may be involved in binding and determined the affinity of 
the interaction between SA I/II and salivary receptor. 

15 

METHODS 
Materials 

SA I/II and recombinant polypeptides were prepared as 
described above. Salivary receptor was prepared by 

20 absorption of whole saliva with intact cells of S. /nutans 
(Lee et al (1989) Infect. Immun. 57:3306-3313). The cells 
were washed with KPBS (2.7 mM KC1, 137mM NaCl in 1.5mM 
KH 2 P0 4 , 6.5mM NajHPO,, pH 7. 2) and adsorbed material was 
eluted with ImM EDTA in KPBS. Analysis of the purified 

25 material by polyacrylamide gel electrophoresis in the 

presence of Na dodecyl sulphate indicated the presence of 
components of Mr>200 / 000 and approximately 40,000. Peptides 
were prepared by the simultaneous multiple peptide 
synthesis procedure (Houghten (1985) Proc. Natl. Acad. Sci. 

3C USA 82: 5131-5135) as above. In addition, a series of 

peptides was synthesised corresponding to residues 1025- 
1044 in which each residue in turn was substituted by 
alanine . 
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Binding Analyses 

Purified SA I/II or salivary receptor was immobilised on 
the sensor chip surface at a concentration of 100 /ig/ml in 
lOmM Na formate pH 3.5 using the amine coupling kit 
5 (Pharmacia Biosensor) . 
1. Inhibition studies 

Binding of immobilised SA I/II to receptors in whole saliva 
was determined in the absence and presence of inhibitors 
(at varying concentrations) . Inhibition by alanine- 

10 substituted peptides was analysed at a peptide 

concentration of 50 /xM. The running buffer was HEPES 
buffered saline (HBS) and the surface was regenerated with 
lOOmM HCl. 
ii. Direct binding 

15 Purified salivary receptor was immobilised on the sensor 
chip and binding of SA I/II or purified recombinant 
polypeptide fragments was determined. 

RESULTS 
20 i. Calcium dependency 

In separate determinations with whole saliva, binding to 
immobilised SA I/II varied from approximately 250 resonance 
units (RU) - 800 RU. In the presence of EDTA, binding was 
inhibited with maximal inhibition of 95% at a concentration 
25 of lOmM EDTA. Subsequent binding assays were performed in 
the presence of 5mM calcium, 
ii. Inhibition of binding 

Purified SA I/II or recombinant polypeptide fragments 1 
(residues 39-481), 2 (residues 475-824), 3 (residues 816- 
30 1213), 4 (residues 1155- 1538) and recombinant 984-1161 

were added to fluid phase saliva as competitive inhibitors 
at concentrations varying from 0-20/iM. SA I/II inhibited 
binding most efficiently with approximately 90% inhibition 
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at a concentration of 6 jxM (Fig. 8) . Of the recombinant 
fragments, only fragment 3 and r984-1161 inhibited binding 
co salivary receptors to a signif icantly greater extent 
than the control (bovine serum albumin) with maximal 
5 inhibition of 65% and 50% , respectively (Fig. 8) . 

A panel of synthetic peptides (20mers overlapping by 10) 
spanning residues 803-1174 was assayed for inhibitory 
activity- Peptide 1025-1044 was the most effective 

10 inhibitor although 10-20 fold higher concentrations were 
required than for polypeptides (Fig. 9) . A panel of 
peptides in which each of the residues 1025-1044 in turn 
were substituted with alanine (alanine was substituted by 
serine where it occurred naturally) was also analysed for 

15 inhibitory activity. Substitution of Glu (1037) 

consistently abolished inhibition mediated by the peptide 
(Fig. 10) . Similarly, substition of Gin 1025, Thr 1039, Phe 
1041, Val 1042, Leu 1043 and Val 1044 reduced the 
inhibition of binding which was mediated by the peptide 

20 1025-1044. 

iii. Direct Binding 

For these analyses, purified salivary receptor was 
immobilised on the sensor chip and binding to fluid phase 
SA I/II or recombinant polypeptides was determined. At a 

25 concentration of approximately 5 (M both SA I/II and 
recombinant SA I/II bound to salivary receptor in the 
range 500-600 RU (Fig. 11) . Binding of recombinant 
polypeptides was determined at a concentration of 
approximately 20/xM and highest binding was obtained with 

30 fragment 3 (1256 RU) (Fig. 11) . Binding of other fragments 
although significantly greater than the myosin control was 
not greater than the bovine serum albumin control and thus 
does not appear to be specific. Addition of EDTA (10 /iM) in 
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this assay completely inhibited binding of fluid phase SA 
I/II. 

Affinity and rate constants for the adhesir. -receptor 
interaction were determined for SA I/II, recombinant SA 
5 I/IT and fragment 3 (Table 3). The values indicate a low 
affinity interaction with a slow association rate constant 
and a relatively rapid dissociation constant . 
Conclusions 

These analyses confirm that residues 816-1213 of SA I/II 
10 form an adhesion binding region and that within this 

region, peptide 1025-1044 forms an adhesion epitope. We 
have now extended these findings by identifying specific 
residues which may be essential for binding to salivary 
receptor, namely residues 1025, 1037, 1039 and 1041-1045. 
15 The binding is EDTA sensitive and, under the assay 
conditions, is of relatively low affinity. 

* 

TABLE 3 

20 





SA I/II 


recomb.SA I/II 


FRAG 3 


K 


(M* 1 s*" 


n.d. 


20.9 x 10 3 


1.5 x 10 3 




(s- 1 ) 


2 x 10* 2 


4.2 x 10° 


8.1 x 10*' 


K A 


(M* 1 ) 


n.d. 


5.0 x 10* 


0.2 x 10 s 



25 



n.d. not determined 
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agflHB8£E INFORMATION 

As a result of the experiments detailed above, the 
following sequences have been identified as being of 
5 particular interest. 

(i) Residues 925 to 1114 (SEQ. ID. No. 1). This sequence 
comprises sequences (iv) and (v) below and includes 2 
series of overlapping T-cell, B-cell and adhesion 

10 epitopes, a further B-cell epitope, a further T-cell 

epitope and an adhesion site. 

SEQ . ID. No. 1: 

1 5 TEKPLEPAPVEPS YEAEPTPPTPTPDQPEPNKPVEPTYEVI PTPPTDPVYQDLPTPPS I 
PTVHFHYFKLAVQPQVNKEIRNNNDWIDRTLVAKQSVVKFQLKTADLPAG V 
LVDPLPSGYQFNPEATKAASPGFDVAYDNATNTVT 
TWGQVLNDGATY 

20 Its DNA sequence is (SEQ. ID. No. 12): 

ACAGAAAAGCCGTTGGAGCCAGCACCTGTTGAGC 

CCGACACC^CACCAGATCAACCAGAAC<3U^CAAAC 

ATTCCAACACCGCCGACTGATCCTGTTTATCAAGATCT^ 

2 5 ACTGTTCATTTCCATTACrTTAAACrAGCTGT^ 

AACJ^TAACGATGTTAATATTGACAGAACTTTGGTGG 
CAGCTGAAGACAGCAGATCrCCCTGCTGGACGTGATGAAACAACTT 
GATCCCCTGCCATCTGGTTATCAATTTAATCCTGAAGCTACAAAAGCT 
TTTGATGTCGCTTATGATAATGCAACTAATACAGTCACCTTCAAGGC^ 

3 0 TTGGCTACGTTTAATGCTGATTTGACTAAGTCACT 

GGACAAGTTCTTAATGATGGCGCAACTTAT 

(ii) Residues 1005 to 1044 (SEQ. ID. No. 2) . This comprises 
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a T-.cell epitope overlapping two adhesion sites. 



SEQ. ID. No. 2 
5 NNNDVNIDRTLVAKQSVVKFQLKTADLPAGRDETTSFVLV 

Its DNA sequence is (SEQ. ID. No. 13) : 
AACAATAACGATGTTAATATTGACAG 

CAGCTGAAGACAGCAGATCTCCCTGCTGGACGTGATGAAACAACTTCCTTTG 

10 

(iii) Residues 1085-1104 (SEQ. ID. No. 3). Here, a T-cell 
epitope, a B-cell epitope and an adhesion site 
overlap . 
SEQ. ID. No. 3: 
15 LATFNADLTKS VAT I YPTW 

Its DNA sequence is (SEQ. ID. No. 14) : 
TTGGCTACGTTTAATGCTGATTTGACTAAGTCAGTG 

20 (iv) Residues 1005 to 1114 (SEQ. ID. No. 4) . This comprises 
sequences (ii) and (iii) above and therefore includes 
two sequences in which a B-cell epitope a T-cell 
epitopes and an adhesion site overlap. 

25 SEQ. ID. No. 4 

NNNDWIDRTLVAKQSVVKFQLKTADLPAGRDE 

D VAYDNATNTVTFKATAATLATFNADLTKS VAT I YPTWGQVLNDGATY 

30 Its DNA sequence is (SEQ. ID. No. 15) : 

AACAATAACGATGTTAATATTGACAGAACTTTGGTC^ 
CAGCTGAAGACAGCAGATCTCCCTGCTGGACGTGATGAAACAA 
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GATCCCCTGCCATCTGGTTATCAATTTAATCCT^ 
TTTGATGTCGCTTATGATAATGCAACTAATACAGTC^ 
TTGGCTACGTTTAATGCTGATTTGACTAAG 
GGACAAGTTCTTAATGATGGCGCAACTTAT 

5 

(v) Residues 925 to 1004 (SBQ. ID. No. 5) . This comprises 
a B-cell epitope,, an innnunodominant T-cell epitope and 
an adhesion site. 

10 SEQ. ID. No. 5: 

TEKPLEPAPVEPSYEAEPTPPTPTPDQPEPNKPVEPTYEVIPTPPTDPVYQDLPTPPSIPT 
VHFHYFKIiAVQPQVNKEIR 

15 Its DNA sequence is (SEQ. ID. No. 16): 

ACAGAAAAGCCGTTGGAGCCAGCACCTGTTGAGCC^ 

CCGACACCAACACCAGATCAACCAGAACCAAAC^ 

ATTCCAACACCGCCGACTG^TCCTGTTTATCAA 

2 0 ACTGTTCATTTCGAITACTTTAAACT^^ 

(vi) Residues 925 to 1054 (SEQ. ID. No. 6) . This comprises 
sequence (v) above, together with a further adjacent 
adhesion site and a further overlapping B-cell epitope. 

25 

SEQ ID. No. 6 

TEKPLEPAPVEPSYEAEPTPPTPTPDQPEPNKPVEPTYEVI PTPPTDPVYQDLPTPPS I PT 
VHFHYFKLAVQPQVNKEIRNNNDVNIDRTLVA^ 

3 0 LPSGYQFN 

Its DNA sequence is (SEQ. ID. No. 17) : 
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ACAGAAAAGCCCnTGGAGCCAGCACCTGTTGAGCCAAGCT 
CCGACACGAACACCAGATCAACCAGAACCT^ 
ATTCCAAC^CCGCCGACTGATCCT^ 
ACTGTTCATTTCCATTACTTTAAACTAGCTGTTC^ 
5 AACAATAACGATGTTAATATTGACAGAACIT^ 

CAGCTGAAGACAGCAGATCTCCCTGCTGGACGTGATGAAAC^ 

GATCCCCTGCCATCTGGTTATCAATTTAAT 

10 <vii) Residues 803-854 (SEQ. ID. No. 7) . This comprises e. 

major T-cell epitope and adjacent immunodominant B- 
cell epitope. 

SEQ. ID. No. 7: 

15 

ETGKKPNIWYSLNGKIRAVNLPKVTKEKPTPPVKPTAPTKPTYETEKPLKPA 
Its DNA sequence is (SEQ. ID. No. 18) 

2 0 GAAACCGGCAAAAAACCAAATATTTGGTATTC^ 

CTTCCTAAAGTTACTAAGGAAAAACCCACACCTC 
CCAACTTATGAAACAGAAAAGCCATTAAAACCGGCA 

(viii) Residues 975 to 1044 (SEQ. ID. No. 8) . This 
25 comprises a T-cell epitope, a B-cell epitope and an 

adhesion site. 

SEQ. ID No. 8: 

3 0 QDLPTPPSIP7VHFHYFKLAVQPQVNKEIRNNN 

DETTSFVLV 

Its DNA sequence is (SEQ. ID. No. 19) : 
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CAAGATCTTCCAACACCTCCATCTATACCAACTGTC 

GTTCAGCCGCAGGTTAACAAAGAAATTAGAAACAATAACGATGTTAATATT 

TTGG'I^CTAAACAATCTGTTGTTAAGTT 

CGTGATGAAACAACTTCCTTTGTCTTGGTA 

5 

(ix) Residues 1024 to 1044 (SEQ. ID. No. 9) . This comprises 
a T-cell epitope overlapping with an adhesion site. 

SEQ. ID. No. 9 
1 0 FQLKTADLPAGRDETTSFVLV 

Its DNA Sequence is (SEQ. ID. No. 20) : 

TTCCAGCTGAAGACAGCAGATCTCCCTGCTGGACGTG 
15 GTA 

(x) Residues 803 to 1114 (SEQ. ID. No. 10) . This comprises 
sequences (i) and (vii) above and some intervening 
sequence. Residues 803 to 1114 comprise 2 series of 

20 overlapping T-cell, B-cell and adhesion epitopes, a 

further T-cell epitope and a further adhesion site and 
an immunodominant B-cell epitope and a major T-cell 
epitope. 

25 SEQ. ID. No. 10: 

ETGKKPNIWYSLNGKIRAVNLPKVTKEKPT^ 
APNYEKEPTPPTRTPDQAEPKXPTPPTYETEKPLEPAP 

PNKPTPPTYETEKPLEPAPVEPSYEAEPTPPTPTPDQPEPNKPVEPTYEVIPTPPTDP 
VYQDLPTPPSIPTVHFHYFKLAVQPQVNKEIRNNNDV 
3 0 LPAGRDETTSFVLVDPLPSGYQFNPEATKAASPGFDVAYDNAT 
NADLTKSVAT I YPTWGQVLNDGATY 

Its DNA Sequence is (SEQ. ID. NO. 21): 
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GAAACCXKSCAAAAAACCAAATATTTGGTAT^ 

CTTCCTAAAGTTACTAAGGAAAAACCCACACCT^ 

CCAACTTATGAAACAGAAAAGCCATTAAAACCGG^ 

GAGCCAACACCACCGACAAGAACACCGGATCAAGCAGAGCC^^ 
5 ACCTATGAAACAGAAAAGCCGTTGGAGCCAGCACCTGTTGAGCCAAGCrAT 

CCAACACCGCCGACAAGGACACCGGATCAGGC^ 

TATGAAACAGAAAAGCC^TTGGAGCCAGCACCTGTTGAGCCAAGCT 

ACGCCACCGACACCAACACCAGATCAACCAGAACCAAACAAACCTG^ 

GAGGTTATTCCAACACCGCCGACTGATCCTGT^ 
1 0 ATACCAACTGTTCATTTCCATTACITTAAACTAGCT 

ATTAGAAACAATAACGATGTTAATATTGACAGAACTTTGGTGGCTAAACAATCT 

AAGTTCCAGCTGAAGACAGCAGATCTCCCTGC^^ 

TTGGTAGATCCCCTGCCATCTGGTTATCAAT^ 

CCTGGCTTTGATGTCGCTTATGATAATGC^ 
1 5 GCAACTTTGGCTACGTTTAATGCTGATTTGACTAAGTCAGTGGCAACGA 

GTGGTCGGACAAGTTCTTAATGATGGCGCAACTTAT 

(xi) Residues 975 to 1004 (SEQ. ID. No, 11), which comprise 
a T-cell epitope. 

20 

SEQ. ID. No. 11: 
QDLPTPPS I PTVHFHYFKLAVQPQVNKE I R 

Its DNA Sequence is (SEQ. ID. NO. 22) : 
2 5 CAAGATCTTCCAACACCTCCATCTATACCAACT 
GTTCAGCCGCAGGTTAACAAAGAAATTAGA 

The amino acid sequence of SA I /II is as follows, beginning 
with residue No. 1 (SEQ ID No. 23) . 

30 

MKVKKTYGFRKSKI SKTLCGAVLG WAAVS VAGQKVFADETTTT 
SDVDTKWGTQTGNPATNLPEAQGSASKQAEQSQTKLERQMW 
AKSAGVNWQDADVNKGTVKTAEEAVQKETE IKEDCT 
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AEVAKIKAKNQATKEQYGKDMVAHKAEVE^ 

TNAANQASYQKALAAYQAELKRVQEANAAAK^ 

NATAKAEYETKLAQYQAELKRVQEANAANEADYQAKLTAYQ 

EAAVAANNAKNAALTAENTAI KQRNENAKATYEAALKQYEADLAAVKKANAANEADYQ 
5 AKLTAYQTELARVQKANADAKAAYEAAVAANNAA^ 

KLAKYQADLAKYQKDLADYPVKLKAYEDEQAS I KAALAELEKHKNEDGNLTEPSAQNL 

VYDLEPNANLSLTTDGKFLKASAVDDAFSKSTSKAKYDQ^ 

ASSMELYGNFGDKAGWSTTVSNNSQVKWGSVLL^ 

I VYKYTVDPKSKFQGQKVWLG I FTDPTLGVFASAYTGQVEKNTS I F I KNEFTFYDEDG 
10 KP INFDNALLSVASLNRENNS I EMAKDYTGKFVKI SGSS IGEKNGMI YATDTLNFRQG 

QGGARWTMYTRASEPGSGWDSSDAPNSVTCGAGAIRM^ 

TMAIETGKKPNIWYSLNGKIRAVNLPKVTKEKPTPPVKPTAP 

APNYEKEPTPPTRTPIX3AEPKKPTPPTYETEKPLEPAPVEPSYEAEPTPPTR 

PNKPTPPTYETEKPLEPAPVEPSYEAEPTPPTPTPDQPEPNKPVEPTYEVIPTPPTD 
1 5 VYQDLPTPPS I PTVHFHYFKLAVQPQVNKEIRNNNDVNIDRTLVAKQSVVKFQLCT 

LPAGRDETTSFVLVDPLPSGYQFNPEATKAASPGFDVAYDNATNTVTFKATAATIATP^ 

NADLTKSVATIYPTWGQVLNIXSAT^ 

PNNNYIKPTKVNKNENGRAfllXSKTVLA^ 

VDDYPEEALEI^QDLVKITDANGNEVTGVSVDNYTSLEAAPQEIRDVLSKAGIRPKGA 
20 FQ I FRADNPREFYDTYVKTG I DLKI VS PMWKKQMGQTGGS YEDQAYQI DFGNGYASN 
IVINNVPKINPKKDVTLTLDPAD^ 

NFYDD YDQTGDHYTGQYKVFAKVD I TLKNGV 1 1 KSGTELTQYTTAEVDTTKGAI TI KF 
KEAFLRS VS I D S AFQAE S Y I QMKR I AVGTFENTY INTVNGVTYS SNTVKTTT PEDPAD 
PTDPQDPSSPRTSTVIIYTCPQSTAYQPSSVQKTLPNTGVTNNAYMPL^ 
25 LGLKAKKD 

Its DNA sequence is as follows (SEQ ID No. 24) : 



1 ATTTCAGCAA AAATTGACAA ATCAAATCAA TTATATTACA ATTTTTTAAC 

30 51 GTATATTACA AAAATATATT TGGAAGATTT ATTCAGATTT GGAGGATTTA 

101 TGAAAGTCAA AAAAACTTAC GGTTTTCGTA AAAGTAAAAT TAGTAAAACA 

151 CTGTGTGGTG CTGTTCTAGG AACAGTAGCA GCAGTCTCTG TAGCAGGACA 

201 AAAGGTTTTT GCCGATGAAA CGACCACTAC TAGTGATGTA GATACTAAAG 
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251 


TAGTTGGAAC 




301 


GGAAGTGCGA 




351 


AATGGTTCAT 




401 


AAGATGCTAA 


5 


451 


AAAGGAACTG 




501 


TAAAGAAGAT 




551 


AATATAAATC 




601 


GCTAAAAATC 




651 


TAAAGCCGAG 


10 


701 


CTTATGAAGC 




751 


AAAACCAATG 




801 


TCAGGCTGAA 




851 


CTTATGATAC 




901 


GCTGCCAATG 


15 


951 


TGAGACTAAG 




1001 


CTAATGCCGC 




1051 


ACAGAGCTTG 




1101 


TGAAGCAGCT 




1151 


AAAATACTGC 


20 


1201 


GCTGCACTCA 




1251 


TGCCGCAAAC 




1301 


AGCTCGCTCG 




1351 


GCAGCTGTAG 




1401 


TACTGCAATT 


25 


1451 


AACTTGCTAA 




1501 


GACTATCCAG 




1551 


AGCTGCACTG 




1601 


CAGAACCATC 




1651 


TTATCTTTGA 


30 


1701 


TGCTTTTAGC 




1751 


AATTAGATGA 




1801 


TCTTCTATGG 




1851 


GACAGTAAGC 



ACAAACTGGA AATCCAGCGA 
GTAAGCAAGC TGAACAAAGT 
ACCATTGAAG TACCTAAAAC 
GTCTGCTGGT GTCAATGTTG 
TTAAAACAGC TGAAGAAGCA 
TACACAAAAC AAGCTGAGGA 
GGATGTAGCT GCTCATGAGG 
AGGCAACTAA AGAACAGTAT 
GTTGAACGCA TTAATGCTGC 
TAAATTGGCT CAATATCAAG 
CTGCCAATCA AGCATCCTAT 
CTGAAACGTG TTCAGGAAGC 
TGCTGTAGCA GCAAATAATG 
AAGAAATTAG AAAACGCAAT 
TTAGCTCAAT ATCAAGCTGA 
AAACGAAGCA GACTATCAAG 
CTCGCGTTCA GAAAGCCAAT 
GTAGCAGCAA ATAATGCCAA 
AATTAAGCAA CGCAATGAGA 
AGCAATATGA GGCTGATTTG 
GAAGCAGACT ATCAAGCTAA 
CGTTCAAAAG GCCAATGCGG 
CAGCAAATAA TGCCGCAAAT 
AAGAAGCGCA ATGCGGATGC 
GTATCAAGCA GATCTTGCCA 
TTAAGTTAAA GGCATACGAA 
GCAGAACTTG AAAAACATAA 
TGCTCAAAAT TTGGTCTATG 
CAACAGATGG GAAGTTCCTT 
AAAAGCACTT CAAAAGCAAA 
TCTAGATATC ACTAACTTAG 
AGCTTTATGG CAATTTTGGT 
AATAACTCAC AGGTTAAATG 



CCAATTTGCC AGAGGCTCAA 
CAAACCAAGC TGGAGAGACA 
TGATCTTGAT CAAGCAGCAA 
TCCAAGATGC CGATGTTAAT 
GTCCAAAAAG AAACTGAAAT 
TATTAAGAAG ACAACAGATC 
CAGAAGTTGC TAAAATCAAA 
GGAAAAGATA TGGTAGCTCA 
AAATGCTGCC AGTAAAACAG 
CAGATTTAGC AGCCGTTCAA 
CAAAAAGCCC TTGCTGCTTA 
TAATGCAGCC GCCAAAGCCG 
CCAAAAATAC AGAAATTGCC 
GCAACGGCCA AAGCTGAATA 
ACTAAAGCGT GTTCAGGAAG 
CTAAATTGAC CGCCTATCAA 
GCAGATGCTA AAGCGGCCTA 
AAATGCGGCA CTTACAGCTG 
ATGCTAAGGC GACTTATGAA 
GCAGCGGTGA AAAAAGCTAA 
ATTGACCGCC TATCAAACAG 
ATGCTAAAGC GGCCTATGAA 
GCAGCGCTCA CAGCTGAAAA 
TAAAGCTGAT TACGAAGCAA 
AATATCAAAA AGATTTAGCA 
GATGAACAAG CTTCTATTAA 
AAATGAAGAC GGAAACTTAA 
ATCTTGAGCC AAATGCGAAC 
AAGGCTTCTG CTGTGGATGA 
ATATGACCAA AAAATTCTTC 
AACAATCTAA TGATGTTGCT 
GATAAAGCTG. GCTGGTCAAC 
GGGATCGGTA CTTTTAGAGC 
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LuLAAuAuL 1 






r*^*ee a r i * t " i 1 


r^A anrTPR a a 

L-HAvjLj 1 LAAA 






A ATYIT'T A AAA 


AVjAI 1 IL 1 aa 






LI 1 IAItoIIsI 


111 ibtll LL 






1 L. 1A1 111 1A 


TTA A A A ATVA 

1 1AAAAA1GA 




Ol CI 


AA1 1AA1 111 


(T* ATA RTPPPP 

LxAIAAIXjULL 




2201 


A1AA1 1U1AT 


TGAGATGG CC 




2251 


TPTPP 7V TV"* 74 rp 

TLloUA 1 UA1 


CTATCGGTGA 






TCTCAACTTT 


AGGCAGGGTC 


10 


2351 


7\ /-I /-Mil 7\ /">0 TV 

GAG CTAGCGA 


ACCXSGGATCT 




2401 


TGGTATGGTG 


CTGGTGCTAT 




2451 


TTTGGGTGCT 


74 fT»/ t>T>/~« 74 m^"<74 74 

ATCTCATCAA 




2501 


caattgaaac 


CGGCAAAAAA 




2551 


ATCCGTGCGG 


TTAATCTTC C 


lb 


2601 


GGTTAAACCA 


AGAGCTCGAA 




2651 


taaaaccggc 


74 TV /^fT>7i /^/wi 

ACCAGTAGCT 




2701 


TV /"^74 7V /"* 7V 7V <"»74 /"^ 

ACAAGAACAC 


CGGAT CAAGC 




2751 


m/^l 74 74 71 ^74 r"»74 74 

TGAAACAGAA 


AAGCCGTTGG 




O O A 1 

2B01 


CAGAGCCAAC 


ACCGCCGACA 


20 


2o51 


CCCACACC.GC 


CGACCTATGA 




2 y ui 


1 CjACjL CAAvjC 


1 A 1 OAAG UAvj 






a a r»r* a r* a a r*o 
AAL LAuAALL 


aaa r* a a a rr^r 
AAALAAALL 1 






emend a r*r^ 


AlLLiul 1 1 A 






AAL l\j 1 1 LAI 


1 ILLAi 1AL1 


25 


3101 


AAGAAATTAG 


AAACAATAAC 




3151 


AAACAATCTG 


TTGTTAAGTT 




3201 


ACGTGATGAA 


ACAACTTCCT 




3251 


ATCAATTTAA 


TCCTGAAGCT 




3301 


GCTTATGATA 


ATGCAACTAA 


30 


3351 


TTTGGCTACG 


TTTAATGCTG 




3401 


CAACAGTGGT 


CGGACAAGTT 




3451 


TTCTCGCTCA 


CAGTCAATGA 




3501 


GGTGACAACT 


CCTGGTAAAC 



ACATACACTA ACCTGCAGAA TTCTTATTAC 
AGGTTTGGTT AGGTATTTTT ACCGATCCAA 
AATTGTCTAC AAGTATACAG TGGACCCTAA 
GCTTATACAG GTCAAGTTGA AAAAAACACT 
ATTCACTTTC TATGACGAAG ATGGAAAACC 
TTCTATCAGT AGCTTCTCTT AACCGAGAAA 
AAAGATTATA CGGGTAAATT TGTCAAAATC 
AAAGAATGGC ATGATTTATG CTACAGATAC 
AAGGTGGTGC TCGTTGGACC ATGTATACCA 
GGCTGGGATA GTTCAGATGC GCCTAACTCT 
CCGCATGTCT GGTCCTAATA ACAGTGTGAC 
CACTTGTTGT GCCTGCTGAT CCTACAATGG 
CCAAATATTT GGTATTCATT AAATGGTAAA 
TAAAGTTACT AAGGAAAAAC CCACACCTCC 
CTAAACCAAC TTATGAAACA GAAAAGCCAT 
CCAAATTATG AAAAGGAGCC AACACCACCG 
AGAGCCAAAG AAACCCACTC CGCCGACCTA 
AGCCAGCACC TGTTGAGCCA AGCTATGAAG 
AGGACACCGG ATCAGGCAGA GCCAAATAAA 
AACAGAAAAG CCGTTGGAGC CAGCACCTGT 
AGCCAACGCC ACCGACACCA ACACCAGATC 
GTTGAGCCAA CTTATGAGGT TATTCCAACA 
TCAAGATCTT CC7UVCACCTC CATCTATACC 
TTAAACTAGC TGTTCAGCCG CAGGTTAACA 
GATGTTAATA TTGACAGAAC TTTGGTGGCT 
CCAGCTGAAG ACAGCAGATC TCCCTGCTGG 
TTGTCTTGGT AGATCCCCTG CCATCTGGTT 
ACAAAAGCTG CCAGCCCTGG CTTTGATGTC 
TACAGTCACC TTCAAGGCAA CTGCAGCAAC 
ATTTGACTAA GTCAGTGGCA ACGATTTATC 
CTTAATGATG GCGCAACTTA TAAGAATAAT 
TGCTTATGGC ATTAAATCCA ATGTTGTTCG 
CAAATGATCC AGATAACCCA AATAATAATT 
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3551 


ACATTAAGCC 


AACTAAGGTT 




3601 


GGTAAAACAG 


TTCTTGCCGG 




3651 


TTTGGATCAA 


TATAAAAACG 




3701 


GATTTTACTA 


TGTAGATGAT 


5 


3751 


GATTTAGTGA 


AGATTACAGA 




3801 


TGTGGATAAT 


TATACTAGTC 




3851 


TTCTTTCTAA 


GGCAGGAATT 




3 901 


GCCGATAATC 


CAAGAGAATT 




3 951 


TTTGAAGATT 


GTATCACCAA 


10 


4001 


GCGGGAGTTA 


TGAAGATCAA 




4051 


GCATCAAATA 


TCGTTATCAA 




4101 


TGTGACCTTA 


ACACTTGATC 




4151 


CTATTCCACT 


TAATACAGTC 




4201 


CCTGCAAATC 


ACTCAGAAGA 


15 


4251 


TGATCAAACA 


GGAGATCACT 




4301 


TTGATATCAC 


TCTTAAAAAC 




4351 


ACTCAGTATA 


CGACAGCGGA 




4401 


TAAGTTCAAG 


GAAGCCTTTC 




4451 


AAGCTGAAAG 


TTATATCCAA 


20 


4501 


AATACCTATA 


TTAATACTGT 




4551 


GAAAACAACT 


ACTCCTGAGG 




4601 


CATCATCACC 


GCGGACTTCA 




4651 


GCTTATCAAC 


CAAGCTCTGT 




4701 


AAACAATGCT 


TATATGCCTT 


25 


4751 


TTAGTTTGCT 


TGGCTTAAAG 




4801 


ATTAGAATTA 


AAAAGTGAGA 




4851 


TCTCATTTTT 


TGATT 



AATAAAAATG AAAATGGCGT TGTTATTGAT 
TTCAACOAAT TATTATGAGC TAACTTGGGA 
ACCGCTCTTC AGCAGATACC ATTCAACAAG 
TATCGAGAAG AAGCGCTTGA ATTGCGTCAG 
TGCTAATGGC AATGAAGTTA CTGGTGTTAG 
TTGAAGCAGC CCCTCAAGAA ATTAGAGATG 
AGACCTAAAG GTGCTTTCCA AATTTTCCGT 
TTATGATACT TATGTCAAAA CTGGAATTGA 
TGGTTGTTAA AAAACAAATG GGACAAACAG 
GCTTACCAAA TTGACTTTGG TAATGGTTAT 
TAATGTTCCT AAGATTAACC CTAAGAAAGA 
CGGCTGATAC AAATAATGTT GATGGTCAGA 
TTTAATTACC GTTTGATTGG TGGCATTATC 
ACTCTTTGAA TACAATTTCT ATGATGATTA 
ATACTGGTCA GTATAAAGTT TTTGCCAAGG 
GGTGTTATTA TCAAGTCAGG TACTGAG1TA 
AGTTGATACC ACTAAAGGTG CTATCACAAT 
TGCGTTCTGT TTCAATTGAT TCAGCCTTCC 
ATGAAACGTA TTGCGGTTGG TACTTTTGAA 
CAATGGGGTA ACTTACAGTT CAAATACAGT 
ATCCTGCAGA CCCTACTGAT CCGCAAGATC 
ACTGTAATTA TCTACAAACC TCAATCAACT 
CCAAAAAACG TTACCAAATA CGGGAGTAAC 
TACTTGGTAT TATTGGCTTA GTTACTAGTT 
GCTAAGAAAG ATTGACAGCA TAGATATTAC 
TGAAGCGATA AATCACAGAT TGAGCTTTTA 
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CLAIMS 

1. A nucleic acid sequence which codes upon expression in 
5 a prokaryotic or eukaryotic host cell for a polypeptide 

product having one or more properties selected from (i) the 
ability to adhere to a" mammalian tooth in a competitive 
manner with naturally occurring Streptococcus mutans antigen 
1/11, thus preventing or diminishing the adhesion of S. mutans 
10 to the tooth; (ii) the ability to stimulate a T-cell 
response; and (iii) the ability to stimulate a B-cell 
response, said nucleic acid sequence being selected from: 

(a) the sequences shown in SEQ. ID. No. 12 to 22 or the 
complementary strands thereof; 
15 (b) nucleic acid sequences having a length of not more 

than 1000 base pairs which hybridise to the sequences 
defined in (a) over at least 70% of their length; 
(c) nucleic acid sequences having a length of not more 
than 1000 base pairs which, but for the degeneracy of 
20 the genetic code, would hybridise to the nucleic acid 

sequences defined in (a) or (b) over at least 70% of 
their length and which sequences code for polypeptides 
having the same amino acid sequence. 

25 2. A nucleic acid sequence as defined in claim 1 which is 
extended at either or both of the 5' and 3' ends by non- wild- 
type nucleic acid sequence. 

30 3. A polypeptide encoded by a nucleic acid sequence as 
defined in claim 1 or 2. 

4 . A polypeptide according to claim 3 comprising a 
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sequence as shown in any one of SEQ. ID. No. 1 to 11. 

5. A vector comprising a nucleic acid sequence according 
5 to claim 1 or 2 operably linked to a promoter capable of 

directing its expression in a host cell. 

6. A Cell harbouring a vector according to claim 5. 

10 7. A pharmaceutical composition comprising a polypeptide 
according to claim 3 or 4 and a pharmaceutically acceptable 
carrier . 

8. A composition comprising according to claim 7 that is a 
15 vaccine composition. 

9. A polypeptide according to claim 3 or 4 for use in the 
treatment of dental caries. 

20 10. A method of vaccinating a mammalian host against dental 
caries or treating dental caries, which method comprises 
administering to the host an effective amount of a 
polypeptide according to claim 3 or 4 or a composition 
according to claim 8 or 9. 
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